A Segmentation-free Approach to Recognise Printed Sinhala Script

نویسنده

  • H. L. Premaratne
چکیده

Majority of character recognition algorithms such as the use of ANNs needs segmentation of the script prior to recognition. Contrast to Western scripts, Brahmi descended South Asian scripts such as Sinhala consist of modifier symbols, which make the segmentation a difficult task that needs to be addressed as a separate issue. Further, the change of shape of the basic character (by violating modification rules) in the modification process makes some modified Sinhala characters impossible to segment. The proposed method, which uses Linear Symmetry to examine a co-relation between characters in the script with the testing alphabet, recognises characters directly within the image of the script. A similar method is used to resolve confusing characters. Experiments show highly favourable results not only for the basic characters of the alphabet but also for the modifier symbols. A novel but simple method using Linear Symmetry for skew correction has also been proposed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A segmentation-free approach to recognise printed Sinhala script using linear symmetry

In this paper, a novel approach for printed character recognition using linear symmetry is proposed. When the conventional character recognition methods such as the arti1cial neural network based techniques are used to recognise Brahmi Sinhala script, segmentation of modi1ed characters into modi1er symbols and basic characters is a necessity but a complex issue. The large size of the character ...

متن کامل

Lexicon and hidden Markov model-based optimisation of the recognised Sinhala script

The Brahmi descended Sinhala script is used by 75% of the 18 million population in Sri Lanka. To the best of our knowledge, none of the Brahmi descended scripts used by hundreds of millions of people in South Asia, possess commercial OCR products. In the process of implementation of an OCR system for the printed Sinhala script which is easily adoptable to similar scripts [Premaratne, L., Assabi...

متن کامل

Recognition of Printed Sinhala Characters Using Linear Symmetry

Sinhala characters used in the Sinhala script by over 70% of the 18 million population in Sri Lanka, have been descended from the ancient Brahmi script. The Sinhala alphabet consists of vowels and consonants and the consonants are modified using modifier symbols to give the required vocal sounds. In the process of developing an OCR for the Sinhala script, characters are initially recognised thr...

متن کامل

Recognition of Modification-based Scripts Using Direction Tensors

The research on the OCR technology for the Latinbased scripts has been successful in achieving the status of image scanners with built-in OCR facility. But, a majority of modification-based scripts such as Brahmi descended South Asian or Ethiopic scripts are still progressing to achieve this status. This indicates the difficulties in adopting the recognition methods that have been proposed so f...

متن کامل

A Neural Network Based Character Recognition System for Sinhala Script

Much effort has been extended in making a computer recognise both typed and handwritten characters automatically. Until quite recently, the focus of this endeavour has been on characters of English Language. As for Asian languages such as Sinhala and Tamil, little or no attention has been given. Methods currently widely used for character recognition for these languages are mainly those which i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004